PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID CA01g07870
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Capsiceae; Capsicum
Family HD-ZIP
Protein Properties Length: 776aa    MW: 86330.8 Da    PI: 5.1173
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
CA01g07870genomePEPView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox62.56.3e-203388156
                TT--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
    Homeobox  1 rrkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56
                +++ +++t  q++e+e+lF+++++p+ ++r +L++ lgL+ rqVk+WFqNrR+++k
  CA01g07870 33 KKRYHRHTVRQIQEMEALFKECPHPDDKQRLKLSQDLGLKPRQVKFWFQNRRTQMK 88
                688899***********************************************998 PP

2START147.51.2e-462384651206
                 HHHHHHHHHHHHHHHHC-TT-EEEEEXCCTTEEEEEEESSS..............SCEEEEEEEECCSCHHHHHHHHHCCCGG...CT-TT-SEEEEEE CS
       START   1 elaeeaaqelvkkalaeepgWvkssesengdevlqkfeeskv.............dsgealrasgvvdmvlallveellddke...qWdetlakaetle 83 
                 ela++ ++elvk+ + +ep+W     ++ g+evl  +e s+               ++ea r s+vv+m++ +lv  +ld+++    +   + +a+t++
  CA01g07870 238 ELALSSMNELVKMCTMGEPLWIWGH-NDSGKEVLNVEEYSRMfpwpvgvkqnsneLRIEATRSSAVVIMNSITLVDAFLDTNKcieLFPSIISRAKTIQ 335
                 578999***************9998.66677777666666667777789999******************************99999999999****** PP

                 EECTT......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE..TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEEEE-EE--S CS
       START  84 vissg......galqlmvaelqalsplvp.RdfvfvRyirq.lgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtwvehvdlkg 174
                 v++sg      g lqlm++e+q+l+plv+ R+ +f+Ry++q  ++g+w+ivd  +ds  ++   +++   +++pSg++i++++ng+s+vtwveh+++++
  CA01g07870 336 VVTSGvsehasGSLQLMFMEMQVLTPLVStRECYFLRYCQQnVEEGSWAIVDFPLDSLHNNF-PAPFPYFKRRPSGCIIQDMPNGYSRVTWVEHAEVEE 433
                 *****************************************99**************99998.58888888**************************** PP

                 SXXHHHHHHHHHHHHHHHHHHHHHHTXXXXXX CS
       START 175 rlphwllrslvksglaegaktwvatlqrqcek 206
                 + + ++++ +v sg+a+ga++w++ lqrqce+
  CA01g07870 434 NPVNQIFNNFVTSGMAFGAQRWLSILQRQCER 465
                 ******************************97 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.601.9E-202191IPR009057Homeodomain-like
SuperFamilySSF466891.04E-182391IPR009057Homeodomain-like
PROSITE profilePS5007116.5683090IPR001356Homeobox domain
SMARTSM003896.3E-193194IPR001356Homeobox domain
CDDcd000861.89E-173391No hitNo description
PfamPF000461.9E-173388IPR001356Homeobox domain
PROSITE patternPS0002706588IPR017970Homeobox, conserved site
PROSITE profilePS5084840.089229468IPR002913START domain
SuperFamilySSF559617.42E-32230467No hitNo description
CDDcd088751.55E-109233464No hitNo description
SMARTSM002341.1E-25238465IPR002913START domain
PfamPF018522.6E-38238465IPR002913START domain
SuperFamilySSF559614.81E-17503661No hitNo description
SuperFamilySSF559614.81E-17698738No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0048497Biological Processmaintenance of floral organ identity
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 776 aa     Download sequence    Send to blast
MDSGSGSEHI EGMSGNELEL PEQQQQQQPP GKKKRYHRHT VRQIQEMEAL FKECPHPDDK  60
QRLKLSQDLG LKPRQVKFWF QNRRTQMKAQ QDRSDNVILR AENDNLKNEN YRLQAALRSV  120
MCPTCGGPAM LGEMGFDEQQ LRLENARLKE EFERVCCLVS QYNGRGPMQG LGSPPPLLPP  180
SLELDMSITN FPRKFEDQPT MPNCADMVPV SLLMPDQNNS QFSGGPMIME EEKSLAMELA  240
LSSMNELVKM CTMGEPLWIW GHNDSGKEVL NVEEYSRMFP WPVGVKQNSN ELRIEATRSS  300
AVVIMNSITL VDAFLDTNKC IELFPSIISR AKTIQVVTSG VSEHASGSLQ LMFMEMQVLT  360
PLVSTRECYF LRYCQQNVEE GSWAIVDFPL DSLHNNFPAP FPYFKRRPSG CIIQDMPNGY  420
SRVTWVEHAE VEENPVNQIF NNFVTSGMAF GAQRWLSILQ RQCERLASLM ARNISDLGVI  480
PSPEARKSLM NLSQRMIKTF CMNISTSYGQ SWTALSDSLE DTVRITTRKV TEPGQPNGLI  540
LSAVSTSWLP YNHFQVFDLL RDERRRAQLD VLSNGNSLHE VAHIANGSHP GNCISLLRIN  600
VASNSSQSVE LMLQESCTDD SGSLVAYTTV EVDAIQLAMN GEDPSCIPLL PLGFVITPVN  660
GQTNNINNSD NISSSEANSS AQSSEKRQNL SSIQEYSGGC LLTIGLQVLA STIPTAKLNL  720
SSVTAINHHL CNTVQQINAA LVAFYPESGI NAPSSPPPPR QPESSKQVDE NSNSS*
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754471e-130HG975447.1 Solanum pennellii chromosome ch08, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_016573090.10.0PREDICTED: homeobox-leucine zipper protein HDG5 isoform X1
RefseqXP_016573098.10.0PREDICTED: homeobox-leucine zipper protein HDG5 isoform X1
RefseqXP_016573102.10.0PREDICTED: homeobox-leucine zipper protein HDG5 isoform X2
SwissprotQ336P20.0ROC3_ORYSJ; Homeobox-leucine zipper protein ROC3
TrEMBLA0A0V0IWB50.0A0A0V0IWB5_SOLCH; Putative homeobox-leucine zipper protein HDG5-like
STRINGSolyc08g076370.2.10.0(Solanum lycopersicum)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
AsteridsOGEA90202226
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT5G46880.10.0homeobox-7